cltk's Repositories

100 repositories

alatinparser
ALP (A Latin Parser) is a syntactic parser for a small subset of classical Latin.
⭐ 3 🌐 Public
ang_models_cltk
No description
⭐ 6 🌐 Public
annotations
A tool for annotating texts using Draft.js
⭐ 13 🌐 Public
arabic_morphology_quranic-corpus
No description
⭐ 2 🌐 Public
arabic_text_perseus
corpus for Classical arabic
⭐ 1 🌐 Public
arabic_text_quranic_corpus
No description
⭐ 0 🌐 Public
bengali_text_wikisource
No description
⭐ 3 🌐 Public
capitains_corpora_converter
Converts CapiTainS-based Repository ( http://capitains.github.io ) to JSON for CLTK
⭐ 0 🌐 Public
capitains_text_corpora
Processed docs from capitains_corpora_converter
⭐ 1 🌐 Public
chinese_text_cbeta_01
Chinese Buddhist scriptures from CBETA
⭐ 0 🌐 Public
chinese_text_cbeta_02
Chinese Buddhist scriptures from CBETA
⭐ 1 🌐 Public
chinese_text_cbeta_indices
Indices to the CBETA corpus
⭐ 4 🌐 Public
chinese_text_cbeta_taf_xml
No description
⭐ 0 🌐 Public
chinese_text_cbeta_txt
No description
⭐ 0 🌐 Public
chinese_text_sheffield
Texts from the Sheffield Corpus of Chinese
⭐ 0 🌐 Public
chinese_text_wikisource
No description
⭐ 0 🌐 Public
classical_arabic_models
Statistical models for Classical Arabic
⭐ 0 🌐 Public
cltk
The Classical Language Toolkit
⭐ 880 🌐 Public
cltk.github.io
Static website for CLTK organization, built with Jekyll
⭐ 1 🌐 Public
cltkv1
Experimental repo for new API CLTK
⭐ 1 🌐 Public πŸ“¦ Archived
cltk_api
RESTful API for the CLTK
⭐ 13 🌐 Public πŸ“¦ Archived
cltk_api_v2
No description
⭐ 1 🌐 Public
cltk_community_api
No description
⭐ 1 🌐 Public
cltk_docker
Docker script for cltk
⭐ 6 🌐 Public
cltk_frontend
Reading environment connecting to API from cltk/cltk_api repo
⭐ 20 🌐 Public πŸ“¦ Archived
cltk_grc_liddell_scott_intermediate
No description
⭐ 1 🌐 Public
cltk_lat_lewis_elementary_lexicon
No description
⭐ 0 🌐 Public
cltk_non_zoega_dictionary
No description
⭐ 0 🌐 Public
cltk_vagrant
Vagrant and other bootstrap methods for CLTK core and CLTK API
⭐ 0 🌐 Public
coptic_text_scriptorium
Public repository for Coptic SCRIPTORIUM Corpora Releases
⭐ 0 🌐 Public
csel_openphilology_corpus
CSEL orpus based on https://github.com/OpenGreekAndLatin/csel-dev/
⭐ 0 🌐 Public
english_texts_wikisource
No description
⭐ 3 🌐 Public
enm_models_cltk
Models for Middle English provided by CLTK
⭐ 1 🌐 Public
escriptorium-deploy
Scripts to deploy the eScriptorium OCR system
⭐ 2 🌐 Public
extras
Place for modules left out of transition to v1.0
⭐ 0 🌐 Public
First1KGreek
XML files for the works in the First Thousand Years of Greek Project.
⭐ 3 🌐 Public
french_lexicon_cltk
Old French lexicon from wikisource.org
⭐ 1 🌐 Public
french_text_wikisource
Collected texts from wikisource.org
⭐ 2 🌐 Public
fro_models_cltk
No description
⭐ 0 🌐 Public
germanic_models_cltk
No description
⭐ 1 🌐 Public
gmh_models_cltk
Stored data for tagging Middle High German
⭐ 1 🌐 Public
gml_models_cltk
No description
⭐ 1 🌐 Public
grc_models_cltk
Trained taggers, tokenizers, etc. for the CLTK
⭐ 9 🌐 Public
grc_software_tlgu
Utility for converting TLG & PHI corpora to Unicode
⭐ 7 🌐 Public
grc_text_perseus
Collected Greek files from the Perseus Digital Library
⭐ 11 🌐 Public
grc_text_tesserae
Plaintext files with Ancient Greek texts from the Tesserae Project
⭐ 6 🌐 Public
greek_lexica_perseus
Lexica and lemmata for the Ancient Greek language, from various sources
⭐ 20 🌐 Public
greek_ner_v1
No description
⭐ 0 🌐 Public
greek_pos_edit_xenophon_anabasis
A human–editable version of a POS–tagged text of Xenophon's Anabasis
⭐ 2 🌐 Public
greek_proper_names_cltk
A list of ~144K Classical Greek proper names
⭐ 4 🌐 Public
greek_software_tlgu_python
A python wrapper for greek_software_tlgu
⭐ 1 🌐 Public
greek_text_lacus_curtius
Collected Greek Texts from Lacus Curtius
⭐ 0 🌐 Public
greek_training_set_sentence_cltk
Training sets and tokenizer for the Classical Greek language, for use with CLTK
⭐ 5 🌐 Public
greek_treebank_perseus
Greek treebank from the Perseus Digital Library
⭐ 12 🌐 Public
greek_word2vec_cltk
Greek Word2Vec models
⭐ 6 🌐 Public
gujarati_text_wikisource
Collected Gujarati texts from wikisource.org
⭐ 1 🌐 Public
hebrew_text_sefaria
Structured Jewish texts and metadata exported from Sefaria's database.
⭐ 2 🌐 Public
hindi_text_ltrc
Corpus of Raw text for Classical Hindi
⭐ 3 🌐 Public
iswoc-treebank
Official releases of the ISWOC treebank
⭐ 0 🌐 Public
javanese_text_gretil
extracted the old javanese text.
⭐ 0 🌐 Public
lapos
Fork of the Lookahead Part-Of-Speech (Lapos) Tagger
⭐ 5 🌐 Public
latin-macronizer
Script to automatically mark long vowels in Latin texts. Also optionally performs conversion of u to v and i to j.
⭐ 1 🌐 Public
latin_lexica_perseus
Lexica and lemmata for the Latin language, from various sources
⭐ 6 🌐 Public
latin_pos_lemmata_cltk
No description
⭐ 11 🌐 Public
latin_proper_names_cltk
A list of ~40K Classical Latin proper names
⭐ 8 🌐 Public
latin_text_antique_digiliblt
Antique Latin Corpus from digilibLT
⭐ 2 🌐 Public
latin_text_corpus_grammaticorum_latinorum
Collected Latin Data from Corpus Grammaticorum Latinorum
⭐ 4 🌐 Public
latin_text_lacus_curtius
Collected Latin files from LacusCurtius
⭐ 2 🌐 Public
latin_text_poeti_ditalia
Corpus for Italian Poetry in Latin
⭐ 1 🌐 Public
latin_training_set_sentence_cltk
Training sets and tokenizer for the Latin language, for use with CLTK
⭐ 4 🌐 Public
latin_treebank_index_thomisticus
Treebank of the works of Thomas Aquinas
⭐ 0 🌐 Public
latin_treebank_perseus
Latin treebank from the Perseus Digital Library
⭐ 5 🌐 Public
latin_word2vec_cltk
Latin Word2Vec models
⭐ 2 🌐 Public
lat_models_cltk
Trained taggers, tokenizers, etc. for the CLTK
⭐ 10 🌐 Public
lat_text_latin_library
Collected files from thelatinlibrary.com
⭐ 22 🌐 Public
lat_text_perseus
Collected Latin files from the Perseus Digital Library
⭐ 13 🌐 Public
lat_text_tesserae
Plaintext files with Latin texts from the Tesserae Project
⭐ 8 🌐 Public
malayalam_text_gretil
contains malayalam_text
⭐ 0 🌐 Public
marathi_text_wikisource
No description
⭐ 7 🌐 Public
middle_english_text_cmepv
Texts from Corpus of Middle English Prose and Verse
⭐ 2 🌐 Public
middle_high_german_texts
No description
⭐ 0 🌐 Public
morpheus
Morpheus parser
⭐ 1 🌐 Public
multilingual_treebank_proiel
Official releases of the PROIEL treebank of ancient Indo-European languages
⭐ 2 🌐 Public
non_models_cltk
Trained tagger for Old Norse
⭐ 0 🌐 Public
non_texts
Classical Texts from Old Norse Literature
⭐ 0 🌐 Public
old-norse-lemmatizer
No description
⭐ 2 🌐 Public
old_church_slavonic_ccmh
No description
⭐ 1 🌐 Public
old_english_text_sacred_texts
No description
⭐ 2 🌐 Public
old_norse_runes_corpus
No description
⭐ 2 🌐 Public
old_norse_texts_heimskringla
Texts retrieved from Heimskrinla.no for easy use with cltk!
⭐ 2 🌐 Public
old_norse_text_perseus
No description
⭐ 2 🌐 Public
old_swedish_texts
No description
⭐ 0 🌐 Public
pali_models_cltk
No description
⭐ 0 🌐 Public
pali_texts_gretil
No description
⭐ 1 🌐 Public
pali_text_ptr_tipitaka
Pali Tipitaka packaged with the Digital Pali Reader
⭐ 3 🌐 Public
prakrit_texts_gretil
No description
⭐ 1 🌐 Public
punjabi_text_gurban
Punjabi Files of Gurbani
⭐ 4 🌐 Public
sanskrit_parallel_gitasupersite
Parallel corpus
⭐ 11 🌐 Public
sanskrit_parallel_sacred_texts
This Repository contains parallel Sanskrit and English Documents.
⭐ 8 🌐 Public
sanskrit_pos_jnu_tagged
No description
⭐ 2 🌐 Public